AITopics | load balancing

Collaborating Authors

load balancing

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

7f9220f90cc85b0da693643add6618e6-Paper-Conference.pdf

Neural Information Processing SystemsFeb-19-2026, 06:52:51 GMT

algorithm, prediction, predictor, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > Virginia > Alexandria County > Alexandria (0.04)
(7 more...)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Algorithms with Prediction Portfolios

Neural Information Processing SystemsAug-16-2025, 11:14:30 GMT

Ideally, we would like the algorithm's performance to depend on the quality of the

artificial intelligence, machine learning, prediction, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > Virginia > Alexandria County > Alexandria (0.04)
(7 more...)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Load Balancing for AI Training Workloads

McClure, Sarah, Ratnasamy, Sylvia, Shenker, Scott

arXiv.org Artificial IntelligenceJul-30-2025

We investigate the performance of various load balancing algorithms for large-scale AI training workloads that are running on dedicated infrastructure. The performance of load balancing depends on both the congestion control and loss recovery algorithms, so our evaluation also sheds light on the appropriate choices for those designs as well.

artificial intelligence, packet, workload, (14 more...)

arXiv.org Artificial Intelligence

2507.21372

Genre: Research Report > New Finding (0.68)

Industry: Energy > Power Industry (0.88)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

AI-Based Demand Forecasting and Load Balancing for Optimising Energy use in Healthcare Systems: A real case study

Rahimi, Iman, Patel, Isha

arXiv.org Artificial IntelligenceJul-9-2025

- This paper addresses the critical need for efficient energy management in healthcare facilities, where fluctuating energy demands challenge both operational and sustainability goals. Traditional energy management methods often fall short in healthcare settings, lead ing to inefficiencies and increased costs. To address this, the paper explores AI - driven approaches for demand forecasting and load balancing, introducing a novel integration of LSTM (Long Short - Term Memory), g enetic a lgorithm, and SHAP (Shapley Additive E xplanations) specifically tailored for healthcare energy management. While LSTM has been widely used for time - series forecasting, its application in healthcare energy demand prediction is underexplored. Here, LSTM is demonstrated to significantly outperfor m ARIMA and Prophet models in handling complex, non - linear demand patterns. Results show that LSTM achieved a Mean Absolute Error (MAE) of 21.69 and Root Mean Square Error (RMSE) of 29.96, significantly improving upon Prophet (MAE: 59.78, RMSE: 81.22) and ARIMA (MAE: 87.73, RMSE: 125.22), highlighting its superior forecasting capability. Genetic algorithm is employed not only for optimising forecasting model parameters but also for dynamically improving load balancing strategies, ensuring adaptability to real - time energy fluctuations. Additionally, SHAP analysis is used to interpret the models and understan d the impact of various input features on predictions, enhancing model transparency and trustworthiness in energy decision - making. The combined LSTM - GA - SH AP approach offers a comprehensive framework that improves forecasting accuracy, enhances energy efficiency, and supports sustainability in healthcare environments. Future work could focus on real - time implementation and further hybridisation with reinforc ement learning for continuous optimisation. This study establishes a strong foundation for leveraging AI in healthcare energy management, showcasing its potential for scalability, efficiency, and resilience. Introduction Australia has a big capacity of using renewable energy in different regions ( Holloway, R, 2023; Rahimi et al., 2025) . Australian healthcare system plays a major role in using renewable energies. Optimising energy use in healthcare systems is essential due to the high and often unpredictable energy demands needed to run medical equipment, keep environmental conditions stable, and support constant patient care.

artificial intelligence, machine learning, prophet, (18 more...)

arXiv.org Artificial Intelligence

2507.06077

Country:

Asia (1.00)
Oceania > Australia (0.88)
North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.52)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (1.00)
Energy > Power Industry (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Interpretable Reinforcement Learning for Load Balancing using Kolmogorov-Arnold Networks

Singh, Kamal, Marouani, Sami, Sheikh, Ahmad Al, Quang, Pham Tran Anh, Habrard, Amaury

arXiv.org Artificial IntelligenceMay-21-2025

As load and delta load increase, the policy puts more flows on the Internet link. Increasing Internet delay puts the flows on MPLS. The contribution of Internet loss seems counter intuitive as it seems to put more load on Internet Link. However, even if its coefficient is near to 1.0, the overall contribution of the term is negligible as compared to load because loss in our scenario varies from 0 to around 0.15. This applies to delay too. For minimising loss, we extract the following: a 1. 9 1 .1( 2 λ 3 + 1) 2 2λ i 5 + 10 d i 3 + u i 10 (4) This policy can be interpreted as follows, and we may refer to Figure 1 as well. The ratio starts near 0.8 and increasing load, with increasing delta, puts more traffic on Internet link. Increasing Internet delay and Internet link utilisation slightly shifts the balance towards putting more traffic on MPLS link. Distillation of symbolic equations of PPO policy: In this method, we train policy using PPO, generate trajectory data and then generate the symbolic equations using auto-regressive models [22].

machine learning, reinforcement learning, traffic, (18 more...)

arXiv.org Artificial Intelligence

2505.14459

Genre: Research Report (0.40)

Industry: Energy > Power Industry (0.43)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Graph Reinforcement Learning for QoS-Aware Load Balancing in Open Radio Access Networks

Semiari, Omid, Nikopour, Hosein, Talwar, Shilpa

arXiv.org Artificial IntelligenceApr-29-2025

Next-generation wireless cellular networks are expected to provide unparalleled Quality-of-Service (QoS) for emerging wireless applications, necessitating strict performance guarantees, e.g., in terms of link-level data rates. A critical challenge in meeting these QoS requirements is the prevention of cell congestion, which involves balancing the load to ensure sufficient radio resources are available for each cell to serve its designated User Equipments (UEs). In this work, a novel QoS-aware Load Balancing (LB) approach is developed to optimize the performance of Guaranteed Bit Rate (GBR) and Best Effort (BE) traffic in a multi-band Open Radio Access Network (O-RAN) under QoS and resource constraints. The proposed solution builds on Graph Reinforcement Learning (GRL), a powerful framework at the intersection of Graph Neural Network (GNN) and RL. The QoS-aware LB is modeled as a Markov Decision Process, with states represented as graphs. QoS consideration are integrated into both state representations and reward signal design. The LB agent is then trained using an off-policy dueling Deep Q Network (DQN) that leverages a GNN-based architecture. This design ensures the LB policy is invariant to the ordering of nodes (UE or cell), flexible in handling various network sizes, and capable of accounting for spatial node dependencies in LB decisions. Performance of the GRL-based solution is compared with two baseline methods. Results show substantial performance gains, including a $53\%$ reduction in QoS violations and a fourfold increase in the 5th percentile rate for BE traffic.

machine learning, reinforcement learning, traffic, (16 more...)

arXiv.org Artificial Intelligence

2504.19499

Genre: Research Report (1.00)

Industry:

Telecommunications (1.00)
Energy > Power Industry (0.64)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Meta-Reinforcement Learning with Discrete World Models for Adaptive Load Balancing

Redovian, Cameron

arXiv.org Artificial IntelligenceMar-11-2025

We integrate a meta-reinforcement learning algorithm with the DreamerV3 architecture to improve load balancing in operating systems. This approach enables rapid adaptation to dynamic workloads with minimal retraining, outperforming the Advantage Actor-Critic (A2C) algorithm in standard and adaptive trials. It demonstrates robust resilience to catastrophic forgetting, maintaining high performance under varying workload distributions and sizes. These findings have important implications for optimizing resource management and performance in modern operating systems. By addressing the challenges posed by dynamic and heterogeneous workloads, our approach advances the adaptability and efficiency of reinforcement learning in real-world system management tasks.

agent, learning, reinforcement, (12 more...)

arXiv.org Artificial Intelligence

2503.08872

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Missouri > Cape Girardeau County > Cape Girardeau (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(2 more...)

Genre: Research Report (0.50)

Industry: Energy > Power Industry (0.63)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A transformer-based deep q learning approach for dynamic load balancing in software-defined networks

Owusu, Evans Tetteh, Agyekum, Kwame Agyemang-Prempeh, Benneh, Marinah, Ayorna, Pius, Agyemang, Justice Owusu, Colley, George Nii Martey, Gazde, James Dzisi

arXiv.org Artificial IntelligenceJan-22-2025

This study proposes a novel approach for dynamic load balancing in Software-Defined Networks (SDNs) using a Transformer-based Deep Q-Network (DQN). Traditional load balancing mechanisms, such as Round Robin (RR) and Weighted Round Robin (WRR), are static and often struggle to adapt to fluctuating traffic conditions, leading to inefficiencies in network performance. In contrast, SDNs offer centralized control and flexibility, providing an ideal platform for implementing machine learning-driven optimization strategies. The core of this research combines a Temporal Fusion Transformer (TFT) for accurate traffic prediction with a DQN model to perform real-time dynamic load balancing. The TFT model predicts future traffic loads, which the DQN uses as input, allowing it to make intelligent routing decisions that optimize throughput, minimize latency, and reduce packet loss. The proposed model was tested against RR and WRR in simulated environments with varying data rates, and the results demonstrate significant improvements in network performance. For the 500MB data rate, the DQN model achieved an average throughput of 0.275 compared to 0.202 and 0.205 for RR and WRR, respectively. Additionally, the DQN recorded lower average latency and packet loss. In the 1000MB simulation, the DQN model outperformed the traditional methods in throughput, latency, and packet loss, reinforcing its effectiveness in managing network loads dynamically. This research presents an important step towards enhancing network performance through the integration of machine learning models within SDNs, potentially paving the way for more adaptive, intelligent network management systems.

machine learning, reinforcement learning, temporal fusion transformer, (16 more...)

arXiv.org Artificial Intelligence

2501.12829

Country: Africa > Ghana (0.15)

Genre: Research Report > New Finding (0.48)

Industry:

Telecommunications > Networks (1.00)
Energy > Power Industry (1.00)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Safe Load Balancing in Software-Defined-Networking

Dinh, Lam, Quang, Pham Tran Anh, Leguay, Jérémie

arXiv.org Artificial IntelligenceOct-22-2024

High performance, reliability and safety are crucial properties of any Software-Defined-Networking (SDN) system. Although the use of Deep Reinforcement Learning (DRL) algorithms has been widely studied to improve performance, their practical applications are still limited as they fail to ensure safe operations in exploration and decision-making. To fill this gap, we explore the design of a Control Barrier Function (CBF) on top of Deep Reinforcement Learning (DRL) algorithms for load-balancing. We show that our DRL-CBF approach is capable of meeting safety requirements during training and testing while achieving near-optimal performance in testing. We provide results using two simulators: a flow-based simulator, which is used for proof-of-concept and benchmarking, and a packet-based simulator that implements real protocols and scheduling. Thanks to the flow-based simulator, we compared the performance against the optimal policy, solving a Non Linear Programming (NLP) problem with the SCIP solver. Furthermore, we showed that pre-trained models in the flow-based simulator, which is faster, can be transferred to the packet simulator, which is slower but more accurate, with some fine-tuning. Overall, the results suggest that near-optimal Quality-of-Service (QoS) performance in terms of end-to-end delay can be achieved while safety requirements related to link capacity constraints are guaranteed. In the packet-based simulator, we also show that our DRL-CBF algorithms outperform non-RL baseline algorithms. When the models are fine-tuned over a few episodes, we achieved smoother QoS and safety in training, and similar performance in testing compared to the case where models have been trained from scratch.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

2410.16846

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(4 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Telecommunications > Networks (0.94)
Information Technology (0.67)
Energy > Power Industry (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Reinforcement Learning-Based Adaptive Load Balancing for Dynamic Cloud Environments

Chawla, Kavish

arXiv.org Artificial IntelligenceSep-7-2024

Efficient load balancing is crucial in cloud computing environments to ensure optimal resource utilization, minimize response times, and prevent server overload. Traditional load balancing algorithms, such as round-robin or least connections, are often static and unable to adapt to the dynamic and fluctuating nature of cloud workloads. In this paper, we propose a novel adaptive load balancing framework using Reinforcement Learning (RL) to address these challenges. The RL-based approach continuously learns and improves the distribution of tasks by observing real-time system performance and making decisions based on traffic patterns and resource availability. Our framework is designed to dynamically reallocate tasks to minimize latency and ensure balanced resource usage across servers. Experimental results show that the proposed RL-based load balancer outperforms traditional algorithms in terms of response time, resource utilization, and adaptability to changing workloads. These findings highlight the potential of AI-driven solutions for enhancing the efficiency and scalability of cloud infrastructures.

algorithm, cloud environment, server, (11 more...)

arXiv.org Artificial Intelligence

2409.04896

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Greater Manchester > Manchester (0.04)

Genre: Research Report > New Finding (0.89)

Industry: Energy > Power Industry (1.00)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback